Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 40541 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 19.4 MiB |
| Average record size in memory | 501.5 B |
Variable types
| Text | 6 |
|---|---|
| Numeric | 6 |
| DateTime | 1 |
latitude is highly overall correlated with temperature_celsius | High correlation |
temperature_celsius is highly overall correlated with latitude | High correlation |
precip_mm is highly skewed (γ1 = 21.67108377) | Skewed |
precip_mm has 27994 (69.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-06 20:56:42.993772 |
|---|---|
| Analysis finished | 2025-04-06 20:56:46.759335 |
| Duration | 3.77 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
country
Text
| Distinct | 185 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 8.6122197 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Afghanistan |
|---|---|
| 2nd row | Albania |
| 3rd row | Algeria |
| 4th row | Andorra |
| 5th row | Angola |
| Value | Count | Frequency (%) |
| and | 1040 | 2.1% |
| islands | 832 | 1.6% |
| republic | 832 | 1.6% |
| united | 624 | 1.2% |
| guinea | 624 | 1.2% |
| saint | 624 | 1.2% |
| bulgaria | 560 | 1.1% |
| indonesia | 426 | 0.8% |
| south | 416 | 0.8% |
| of | 416 | 0.8% |
| Other values (202) | 44129 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 55462 | |
| i | 31264 | 9.0% |
| n | 28268 | 8.1% |
| e | 23378 | 6.7% |
| r | 19595 | 5.6% |
| o | 16472 | 4.7% |
| u | 13981 | 4.0% |
| l | 13081 | 3.7% |
| t | 12908 | 3.7% |
| s | 12102 | 3.5% |
| Other values (42) | 122637 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 349148 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 55462 | |
| i | 31264 | 9.0% |
| n | 28268 | 8.1% |
| e | 23378 | 6.7% |
| r | 19595 | 5.6% |
| o | 16472 | 4.7% |
| u | 13981 | 4.0% |
| l | 13081 | 3.7% |
| t | 12908 | 3.7% |
| s | 12102 | 3.5% |
| Other values (42) | 122637 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 349148 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 55462 | |
| i | 31264 | 9.0% |
| n | 28268 | 8.1% |
| e | 23378 | 6.7% |
| r | 19595 | 5.6% |
| o | 16472 | 4.7% |
| u | 13981 | 4.0% |
| l | 13081 | 3.7% |
| t | 12908 | 3.7% |
| s | 12102 | 3.5% |
| Other values (42) | 122637 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 349148 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 55462 | |
| i | 31264 | 9.0% |
| n | 28268 | 8.1% |
| e | 23378 | 6.7% |
| r | 19595 | 5.6% |
| o | 16472 | 4.7% |
| u | 13981 | 4.0% |
| l | 13081 | 3.7% |
| t | 12908 | 3.7% |
| s | 12102 | 3.5% |
| Other values (42) | 122637 |
location_name
Text
| Distinct | 220 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 15 |
| Mean length | 7.6521053 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Kabul |
|---|---|
| 2nd row | Tirana |
| 3rd row | Algiers |
| 4th row | Andorra La Vella |
| 5th row | Luanda |
| Value | Count | Frequency (%) |
| city | 1040 | 2.2% |
| port | 832 | 1.8% |
| san | 624 | 1.3% |
| saint | 416 | 0.9% |
| beirut | 209 | 0.4% |
| tbilisi | 209 | 0.4% |
| vienna | 208 | 0.4% |
| baku | 208 | 0.4% |
| dhaka | 208 | 0.4% |
| gaborone | 208 | 0.4% |
| Other values (231) | 42422 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 46563 | |
| o | 22382 | 7.2% |
| i | 21582 | 7.0% |
| n | 20478 | 6.6% |
| r | 18849 | 6.1% |
| e | 17231 | 5.6% |
| u | 13742 | 4.4% |
| s | 12501 | 4.0% |
| t | 12266 | 4.0% |
| l | 10448 | 3.4% |
| Other values (44) | 114182 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 310224 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 46563 | |
| o | 22382 | 7.2% |
| i | 21582 | 7.0% |
| n | 20478 | 6.6% |
| r | 18849 | 6.1% |
| e | 17231 | 5.6% |
| u | 13742 | 4.4% |
| s | 12501 | 4.0% |
| t | 12266 | 4.0% |
| l | 10448 | 3.4% |
| Other values (44) | 114182 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 310224 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 46563 | |
| o | 22382 | 7.2% |
| i | 21582 | 7.0% |
| n | 20478 | 6.6% |
| r | 18849 | 6.1% |
| e | 17231 | 5.6% |
| u | 13742 | 4.4% |
| s | 12501 | 4.0% |
| t | 12266 | 4.0% |
| l | 10448 | 3.4% |
| Other values (44) | 114182 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 310224 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 46563 | |
| o | 22382 | 7.2% |
| i | 21582 | 7.0% |
| n | 20478 | 6.6% |
| r | 18849 | 6.1% |
| e | 17231 | 5.6% |
| u | 13742 | 4.4% |
| s | 12501 | 4.0% |
| t | 12266 | 4.0% |
| l | 10448 | 3.4% |
| Other values (44) | 114182 |
latitude
Real number (ℝ)
High correlation 
| Distinct | 216 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.298092 |
| Minimum | -41.3 |
|---|---|
| Maximum | 64.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 8909 |
| Negative (%) | 22.0% |
| Memory size | 316.9 KiB |
Quantile statistics
| Minimum | -41.3 |
|---|---|
| 5-th percentile | -24.65 |
| Q1 | 3.75 |
| median | 17.25 |
| Q3 | 41.32 |
| 95-th percentile | 53.9 |
| Maximum | 64.1 |
| Range | 105.4 |
| Interquartile range (IQR) | 37.57 |
Descriptive statistics
| Standard deviation | 24.52135 |
|---|---|
| Coefficient of variation (CV) | 1.2706619 |
| Kurtosis | -0.76273504 |
| Mean | 19.298092 |
| Median Absolute Deviation (MAD) | 20.63 |
| Skewness | -0.30646662 |
| Sum | 782363.95 |
| Variance | 601.29658 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39.93 | 416 | 1.0% |
| 41.9 | 416 | 1.0% |
| 12.15 | 415 | 1.0% |
| 40.4 | 414 | 1.0% |
| 41.73 | 209 | 0.5% |
| 33.87 | 209 | 0.5% |
| 42 | 208 | 0.5% |
| -25.97 | 208 | 0.5% |
| -22.57 | 208 | 0.5% |
| 39.55 | 208 | 0.5% |
| Other values (206) | 37630 |
| Value | Count | Frequency (%) |
| -41.3 | 207 | |
| -35.28 | 208 | |
| -34.86 | 208 | |
| -34.59 | 208 | |
| -33.45 | 208 | |
| -29.32 | 208 | |
| -26.32 | 182 | |
| -25.97 | 208 | |
| -25.75 | 208 | |
| -24.65 | 208 |
| Value | Count | Frequency (%) |
| 64.1 | 25 | 0.1% |
| 63.83 | 183 | |
| 60.18 | 208 | |
| 59.92 | 208 | |
| 59.43 | 208 | |
| 59.33 | 208 | |
| 56.95 | 208 | |
| 55.75 | 208 | |
| 55.67 | 208 | |
| 54.68 | 208 |
longitude
Real number (ℝ)
| Distinct | 217 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.759817 |
| Minimum | -175.2 |
|---|---|
| Maximum | 179.22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 11252 |
| Negative (%) | 27.8% |
| Memory size | 316.9 KiB |
Quantile statistics
| Minimum | -175.2 |
|---|---|
| 5-th percentile | -84.08 |
| Q1 | -6.84 |
| median | 23.24 |
| Q3 | 49.88 |
| 95-th percentile | 147.19 |
| Maximum | 179.22 |
| Range | 354.42 |
| Interquartile range (IQR) | 56.72 |
Descriptive statistics
| Standard deviation | 65.682563 |
|---|---|
| Coefficient of variation (CV) | 3.0185255 |
| Kurtosis | 0.34382658 |
| Mean | 21.759817 |
| Median Absolute Deviation (MAD) | 28.09 |
| Skewness | 0.011654161 |
| Sum | 882164.74 |
| Variance | 4314.1991 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14.51 | 416 | 1.0% |
| 12.45 | 416 | 1.0% |
| -58.17 | 416 | 1.0% |
| 44.79 | 209 | 0.5% |
| 35.51 | 209 | 0.5% |
| 4.89 | 208 | 0.5% |
| 19.26 | 208 | 0.5% |
| -6.84 | 208 | 0.5% |
| 32.59 | 208 | 0.5% |
| 17.08 | 208 | 0.5% |
| Other values (207) | 37835 |
| Value | Count | Frequency (%) |
| -175.2 | 208 | |
| -171.73 | 206 | |
| -123.04 | 21 | 0.1% |
| -120.49 | 187 | |
| -99.13 | 208 | |
| -90.53 | 208 | |
| -89.2 | 208 | |
| -88.77 | 208 | |
| -87.22 | 208 | |
| -86.27 | 207 |
| Value | Count | Frequency (%) |
| 179.22 | 207 | |
| 178.42 | 208 | |
| 174.78 | 207 | |
| 171.38 | 208 | |
| 169.53 | 206 | |
| 168.32 | 208 | |
| 159.95 | 208 | |
| 158.15 | 208 | |
| 149.22 | 208 | |
| 147.19 | 208 |
last_updated
Date
| Distinct | 6021 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 316.9 KiB |
| Minimum | 2023-08-29 02:45:00 |
|---|---|
| Maximum | 2024-03-29 05:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
temperature_celsius
Real number (ℝ)
High correlation 
| Distinct | 628 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.193458 |
| Minimum | -41.9 |
|---|---|
| Maximum | 45.4 |
| Zeros | 359 |
| Zeros (%) | 0.9% |
| Negative | 1932 |
| Negative (%) | 4.8% |
| Memory size | 316.9 KiB |
Quantile statistics
| Minimum | -41.9 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 12 |
| median | 22 |
| Q3 | 27 |
| 95-th percentile | 32 |
| Maximum | 45.4 |
| Range | 87.3 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.71348 |
|---|---|
| Coefficient of variation (CV) | 0.55818394 |
| Kurtosis | 0.98066486 |
| Mean | 19.193458 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.91195662 |
| Sum | 778122 |
| Variance | 114.77866 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 27 | 1906 | 4.7% |
| 26 | 1800 | 4.4% |
| 28 | 1780 | 4.4% |
| 29 | 1655 | 4.1% |
| 25 | 1616 | 4.0% |
| 30 | 1436 | 3.5% |
| 24 | 1050 | 2.6% |
| 31 | 988 | 2.4% |
| 23 | 859 | 2.1% |
| 14 | 832 | 2.1% |
| Other values (618) | 26619 |
| Value | Count | Frequency (%) |
| -41.9 | 1 | < 0.1% |
| -39.4 | 1 | < 0.1% |
| -39 | 1 | < 0.1% |
| -38.3 | 1 | < 0.1% |
| -38 | 2 | |
| -37.6 | 1 | < 0.1% |
| -37 | 1 | < 0.1% |
| -36 | 4 | |
| -35.7 | 1 | < 0.1% |
| -35.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 45.4 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 44.9 | 1 | < 0.1% |
| 44.3 | 1 | < 0.1% |
| 44 | 3 | |
| 43.9 | 1 | < 0.1% |
| 43.8 | 1 | < 0.1% |
| 43.6 | 1 | < 0.1% |
| 43.3 | 1 | < 0.1% |
| 43.2 | 1 | < 0.1% |
precip_mm
Real number (ℝ)
Skewed  Zeros 
| Distinct | 435 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.13340643 |
| Minimum | 0 |
|---|---|
| Maximum | 39.64 |
| Zeros | 27994 |
| Zeros (%) | 69.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 316.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.02 |
| 95-th percentile | 0.77 |
| Maximum | 39.64 |
| Range | 39.64 |
| Interquartile range (IQR) | 0.02 |
Descriptive statistics
| Standard deviation | 0.6168834 |
|---|---|
| Coefficient of variation (CV) | 4.6240906 |
| Kurtosis | 891.5391 |
| Mean | 0.13340643 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.671084 |
| Sum | 5408.43 |
| Variance | 0.38054513 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 27994 | |
| 0.01 | 1925 | 4.7% |
| 0.02 | 1026 | 2.5% |
| 0.03 | 688 | 1.7% |
| 0.04 | 504 | 1.2% |
| 0.1 | 439 | 1.1% |
| 0.05 | 417 | 1.0% |
| 0.06 | 340 | 0.8% |
| 0.07 | 324 | 0.8% |
| 0.08 | 271 | 0.7% |
| Other values (425) | 6613 | 16.3% |
| Value | Count | Frequency (%) |
| 0 | 27994 | |
| 0.01 | 1925 | 4.7% |
| 0.02 | 1026 | 2.5% |
| 0.03 | 688 | 1.7% |
| 0.04 | 504 | 1.2% |
| 0.05 | 417 | 1.0% |
| 0.06 | 340 | 0.8% |
| 0.07 | 324 | 0.8% |
| 0.08 | 271 | 0.7% |
| 0.09 | 247 | 0.6% |
| Value | Count | Frequency (%) |
| 39.64 | 1 | |
| 31 | 1 | |
| 28.7 | 1 | |
| 21.68 | 1 | |
| 19.6 | 1 | |
| 18.63 | 1 | |
| 17.73 | 1 | |
| 17.7 | 1 | |
| 17.69 | 1 | |
| 15.07 | 1 |
humidity
Real number (ℝ)
| Distinct | 98 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.265805 |
| Minimum | 3 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 316.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 27 |
| Q1 | 59 |
| median | 75 |
| Q3 | 87 |
| 95-th percentile | 97 |
| Maximum | 100 |
| Range | 97 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 21.085619 |
|---|---|
| Coefficient of variation (CV) | 0.30008365 |
| Kurtosis | 0.30464366 |
| Mean | 70.265805 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.89058337 |
| Sum | 2848646 |
| Variance | 444.60335 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 94 | 2519 | 6.2% |
| 100 | 1816 | 4.5% |
| 79 | 1552 | 3.8% |
| 89 | 1431 | 3.5% |
| 84 | 1425 | 3.5% |
| 93 | 1386 | 3.4% |
| 75 | 1234 | 3.0% |
| 70 | 1146 | 2.8% |
| 87 | 1099 | 2.7% |
| 88 | 1072 | 2.6% |
| Other values (88) | 25861 |
| Value | Count | Frequency (%) |
| 3 | 4 | < 0.1% |
| 4 | 22 | 0.1% |
| 5 | 37 | |
| 6 | 62 | |
| 7 | 53 | |
| 8 | 66 | |
| 9 | 79 | |
| 10 | 54 | |
| 11 | 73 | |
| 12 | 89 |
| Value | Count | Frequency (%) |
| 100 | 1816 | |
| 99 | 45 | 0.1% |
| 98 | 104 | 0.3% |
| 97 | 118 | 0.3% |
| 96 | 113 | 0.3% |
| 95 | 123 | 0.3% |
| 94 | 2519 | |
| 93 | 1386 | |
| 92 | 225 | 0.6% |
| 91 | 155 | 0.4% |
air_quality_PM2.5
Real number (ℝ)
| Distinct | 2260 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.23774 |
| Minimum | 0.5 |
|---|---|
| Maximum | 1558.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 316.9 KiB |
Quantile statistics
| Minimum | 0.5 |
|---|---|
| 5-th percentile | 0.6 |
| Q1 | 2.5 |
| median | 7.5 |
| Q3 | 23 |
| 95-th percentile | 97 |
| Maximum | 1558.8 |
| Range | 1558.3 |
| Interquartile range (IQR) | 20.5 |
Descriptive statistics
| Standard deviation | 63.649937 |
|---|---|
| Coefficient of variation (CV) | 2.5220142 |
| Kurtosis | 104.84917 |
| Mean | 25.23774 |
| Median Absolute Deviation (MAD) | 6.2 |
| Skewness | 8.5194913 |
| Sum | 1023163.2 |
| Variance | 4051.3145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 1941 | 4.8% |
| 0.9 | 479 | 1.2% |
| 1 | 478 | 1.2% |
| 1.1 | 447 | 1.1% |
| 0.8 | 442 | 1.1% |
| 1.3 | 430 | 1.1% |
| 1.5 | 430 | 1.1% |
| 0.6 | 428 | 1.1% |
| 1.2 | 427 | 1.1% |
| 0.7 | 427 | 1.1% |
| Other values (2250) | 34612 |
| Value | Count | Frequency (%) |
| 0.5 | 1941 | |
| 0.6 | 428 | 1.1% |
| 0.7 | 427 | 1.1% |
| 0.8 | 442 | 1.1% |
| 0.9 | 479 | 1.2% |
| 1 | 478 | 1.2% |
| 1.1 | 447 | 1.1% |
| 1.2 | 427 | 1.1% |
| 1.3 | 430 | 1.1% |
| 1.4 | 424 | 1.0% |
| Value | Count | Frequency (%) |
| 1558.8 | 1 | |
| 1329.2 | 1 | |
| 1253.9 | 1 | |
| 1233 | 1 | |
| 1199.1 | 1 | |
| 1179.3 | 1 | |
| 1163.5 | 1 | |
| 1160.2 | 1 | |
| 1146.7 | 1 | |
| 1133 | 1 |
| Distinct | 185 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 8.6022545 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | afghanistan |
|---|---|
| 2nd row | albania |
| 3rd row | algeria |
| 4th row | andorra |
| 5th row | angola |
| Value | Count | Frequency (%) |
| and | 1040 | 2.1% |
| islands | 832 | 1.6% |
| republic | 832 | 1.6% |
| united | 624 | 1.2% |
| guinea | 624 | 1.2% |
| saint | 624 | 1.2% |
| bulgaria | 560 | 1.1% |
| indonesia | 426 | 0.8% |
| south | 416 | 0.8% |
| of | 416 | 0.8% |
| Other values (202) | 44129 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 58790 | |
| i | 34060 | 9.8% |
| n | 30552 | 8.8% |
| e | 25042 | 7.2% |
| r | 21384 | 6.1% |
| s | 17998 | 5.2% |
| o | 16680 | 4.8% |
| t | 15597 | 4.5% |
| u | 15437 | 4.4% |
| l | 15358 | 4.4% |
| Other values (17) | 97846 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 348744 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 58790 | |
| i | 34060 | 9.8% |
| n | 30552 | 8.8% |
| e | 25042 | 7.2% |
| r | 21384 | 6.1% |
| s | 17998 | 5.2% |
| o | 16680 | 4.8% |
| t | 15597 | 4.5% |
| u | 15437 | 4.4% |
| l | 15358 | 4.4% |
| Other values (17) | 97846 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 348744 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 58790 | |
| i | 34060 | 9.8% |
| n | 30552 | 8.8% |
| e | 25042 | 7.2% |
| r | 21384 | 6.1% |
| s | 17998 | 5.2% |
| o | 16680 | 4.8% |
| t | 15597 | 4.5% |
| u | 15437 | 4.4% |
| l | 15358 | 4.4% |
| Other values (17) | 97846 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 348744 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 58790 | |
| i | 34060 | 9.8% |
| n | 30552 | 8.8% |
| e | 25042 | 7.2% |
| r | 21384 | 6.1% |
| s | 17998 | 5.2% |
| o | 16680 | 4.8% |
| t | 15597 | 4.5% |
| u | 15437 | 4.4% |
| l | 15358 | 4.4% |
| Other values (17) | 97846 |
City_normalized
Text
| Distinct | 220 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 15 |
| Mean length | 7.6014898 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | kabul |
|---|---|
| 2nd row | tirana |
| 3rd row | algiers |
| 4th row | andorra la vella |
| 5th row | luanda |
| Value | Count | Frequency (%) |
| city | 1040 | 2.2% |
| port | 832 | 1.8% |
| san | 624 | 1.3% |
| saint | 416 | 0.9% |
| beirut | 209 | 0.4% |
| tbilisi | 209 | 0.4% |
| vienna | 208 | 0.4% |
| baku | 208 | 0.4% |
| dhaka | 208 | 0.4% |
| gaborone | 208 | 0.4% |
| Other values (231) | 42422 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 50656 | |
| o | 23214 | 7.5% |
| n | 22973 | 7.5% |
| i | 22026 | 7.1% |
| r | 19934 | 6.5% |
| e | 17356 | 5.6% |
| s | 16661 | 5.4% |
| t | 14361 | 4.7% |
| u | 13948 | 4.5% |
| l | 13369 | 4.3% |
| Other values (17) | 93674 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 308172 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 50656 | |
| o | 23214 | 7.5% |
| n | 22973 | 7.5% |
| i | 22026 | 7.1% |
| r | 19934 | 6.5% |
| e | 17356 | 5.6% |
| s | 16661 | 5.4% |
| t | 14361 | 4.7% |
| u | 13948 | 4.5% |
| l | 13369 | 4.3% |
| Other values (17) | 93674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 308172 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 50656 | |
| o | 23214 | 7.5% |
| n | 22973 | 7.5% |
| i | 22026 | 7.1% |
| r | 19934 | 6.5% |
| e | 17356 | 5.6% |
| s | 16661 | 5.4% |
| t | 14361 | 4.7% |
| u | 13948 | 4.5% |
| l | 13369 | 4.3% |
| Other values (17) | 93674 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 308172 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 50656 | |
| o | 23214 | 7.5% |
| n | 22973 | 7.5% |
| i | 22026 | 7.1% |
| r | 19934 | 6.5% |
| e | 17356 | 5.6% |
| s | 16661 | 5.4% |
| t | 14361 | 4.7% |
| u | 13948 | 4.5% |
| l | 13369 | 4.3% |
| Other values (17) | 93674 |
city_prefix
Text
| Distinct | 181 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | kab |
|---|---|
| 2nd row | tir |
| 3rd row | alg |
| 4th row | and |
| 5th row | lua |
| Value | Count | Frequency (%) |
| san | 1248 | 3.1% |
| por | 1248 | 3.1% |
| ban | 831 | 2.0% |
| bra | 624 | 1.5% |
| mon | 624 | 1.5% |
| man | 622 | 1.5% |
| bei | 417 | 1.0% |
| bel | 416 | 1.0% |
| kin | 416 | 1.0% |
| sai | 416 | 1.0% |
| Other values (172) | 33721 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 19209 | |
| n | 9381 | 7.7% |
| o | 8003 | 6.6% |
| r | 7872 | 6.5% |
| i | 7759 | 6.4% |
| s | 7743 | 6.4% |
| b | 7679 | 6.3% |
| m | 5881 | 4.8% |
| l | 5756 | 4.7% |
| u | 5660 | 4.7% |
| Other values (17) | 36680 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 121623 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 19209 | |
| n | 9381 | 7.7% |
| o | 8003 | 6.6% |
| r | 7872 | 6.5% |
| i | 7759 | 6.4% |
| s | 7743 | 6.4% |
| b | 7679 | 6.3% |
| m | 5881 | 4.8% |
| l | 5756 | 4.7% |
| u | 5660 | 4.7% |
| Other values (17) | 36680 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 121623 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 19209 | |
| n | 9381 | 7.7% |
| o | 8003 | 6.6% |
| r | 7872 | 6.5% |
| i | 7759 | 6.4% |
| s | 7743 | 6.4% |
| b | 7679 | 6.3% |
| m | 5881 | 4.8% |
| l | 5756 | 4.7% |
| u | 5660 | 4.7% |
| Other values (17) | 36680 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 121623 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 19209 | |
| n | 9381 | 7.7% |
| o | 8003 | 6.6% |
| r | 7872 | 6.5% |
| i | 7759 | 6.4% |
| s | 7743 | 6.4% |
| b | 7679 | 6.3% |
| m | 5881 | 4.8% |
| l | 5756 | 4.7% |
| u | 5660 | 4.7% |
| Other values (17) | 36680 |
country_prefix
Text
| Distinct | 157 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | afg |
|---|---|
| 2nd row | alb |
| 3rd row | alg |
| 4th row | and |
| 5th row | ang |
| Value | Count | Frequency (%) |
| mal | 1040 | 2.6% |
| bel | 804 | 2.0% |
| ind | 634 | 1.6% |
| tur | 624 | 1.5% |
| sai | 624 | 1.5% |
| uni | 624 | 1.5% |
| mon | 622 | 1.5% |
| bul | 560 | 1.4% |
| ira | 498 | 1.2% |
| slo | 464 | 1.1% |
| Other values (147) | 34047 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 14818 | |
| i | 8631 | 7.1% |
| u | 8604 | 7.1% |
| r | 8522 | 7.0% |
| n | 8366 | 6.9% |
| e | 8318 | 6.8% |
| m | 7680 | 6.3% |
| s | 7269 | 6.0% |
| o | 6494 | 5.3% |
| l | 5989 | 4.9% |
| Other values (17) | 36932 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 121623 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 14818 | |
| i | 8631 | 7.1% |
| u | 8604 | 7.1% |
| r | 8522 | 7.0% |
| n | 8366 | 6.9% |
| e | 8318 | 6.8% |
| m | 7680 | 6.3% |
| s | 7269 | 6.0% |
| o | 6494 | 5.3% |
| l | 5989 | 4.9% |
| Other values (17) | 36932 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 121623 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 14818 | |
| i | 8631 | 7.1% |
| u | 8604 | 7.1% |
| r | 8522 | 7.0% |
| n | 8366 | 6.9% |
| e | 8318 | 6.8% |
| m | 7680 | 6.3% |
| s | 7269 | 6.0% |
| o | 6494 | 5.3% |
| l | 5989 | 4.9% |
| Other values (17) | 36932 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 121623 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 14818 | |
| i | 8631 | 7.1% |
| u | 8604 | 7.1% |
| r | 8522 | 7.0% |
| n | 8366 | 6.9% |
| e | 8318 | 6.8% |
| m | 7680 | 6.3% |
| s | 7269 | 6.0% |
| o | 6494 | 5.3% |
| l | 5989 | 4.9% |
| Other values (17) | 36932 |
Interactions
Correlations
| air_quality_PM2.5 | humidity | latitude | longitude | precip_mm | temperature_celsius | |
|---|---|---|---|---|---|---|
| air_quality_PM2.5 | 1.000 | -0.176 | 0.055 | 0.180 | -0.282 | 0.002 |
| humidity | -0.176 | 1.000 | 0.094 | 0.200 | 0.328 | -0.277 |
| latitude | 0.055 | 0.094 | 1.000 | -0.051 | -0.113 | -0.638 |
| longitude | 0.180 | 0.200 | -0.051 | 1.000 | -0.033 | -0.229 |
| precip_mm | -0.282 | 0.328 | -0.113 | -0.033 | 1.000 | 0.033 |
| temperature_celsius | 0.002 | -0.277 | -0.638 | -0.229 | 0.033 | 1.000 |
Missing values
Sample
| country | location_name | latitude | longitude | last_updated | temperature_celsius | precip_mm | humidity | air_quality_PM2.5 | Country_normalized | City_normalized | city_prefix | country_prefix | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Afghanistan | Kabul | 34.52 | 69.18 | 2023-08-29 14:00 | 28.8 | 0.0 | 19 | 7.9 | afghanistan | kabul | kab | afg |
| 1 | Albania | Tirana | 41.33 | 19.82 | 2023-08-29 11:30 | 27.0 | 0.0 | 54 | 28.2 | albania | tirana | tir | alb |
| 2 | Algeria | Algiers | 36.76 | 3.05 | 2023-08-29 10:30 | 28.0 | 0.0 | 30 | 6.4 | algeria | algiers | alg | alg |
| 3 | Andorra | Andorra La Vella | 42.50 | 1.52 | 2023-08-29 11:30 | 10.2 | 0.0 | 51 | 0.5 | andorra | andorra la vella | and | and |
| 4 | Angola | Luanda | -8.84 | 13.23 | 2023-08-29 10:30 | 25.0 | 0.0 | 69 | 139.6 | angola | luanda | lua | ang |
| 5 | Antigua and Barbuda | Saint John's | 17.12 | -61.85 | 2023-08-29 05:30 | 29.0 | 0.3 | 79 | 0.8 | antigua and barbuda | saint johns | sai | ant |
| 6 | Argentina | Buenos Aires | -34.59 | -58.67 | 2023-08-29 06:30 | 9.0 | 0.0 | 71 | 2.1 | argentina | buenos aires | bue | arg |
| 7 | Armenia | Yerevan | 40.18 | 44.51 | 2023-08-29 13:30 | 31.0 | 0.0 | 26 | 5.0 | armenia | yerevan | yer | arm |
| 8 | Australia | Canberra | -35.28 | 149.22 | 2023-08-29 19:30 | 13.0 | 0.0 | 62 | 4.0 | australia | canberra | can | aus |
| 9 | Austria | Vienna | 48.20 | 16.37 | 2023-08-29 11:30 | 16.0 | 0.0 | 82 | 13.1 | austria | vienna | vie | aus |
| country | location_name | latitude | longitude | last_updated | temperature_celsius | precip_mm | humidity | air_quality_PM2.5 | Country_normalized | City_normalized | city_prefix | country_prefix | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 40531 | United Kingdom | London | 51.52 | -0.11 | 2024-03-28 16:00 | 10.0 | 0.07 | 82 | 0.9 | united kingdom | london | lon | uni |
| 40532 | United States of America | Washington Park | 46.60 | -120.49 | 2024-03-28 09:00 | 3.3 | 0.00 | 82 | 4.5 | united states of america | washington park | was | uni |
| 40533 | Uruguay | Montevideo | -34.86 | -56.17 | 2024-03-28 13:00 | 24.0 | 0.00 | 54 | 6.2 | uruguay | montevideo | mon | uru |
| 40534 | Uzbekistan | Tashkent | 41.32 | 69.25 | 2024-03-28 21:00 | 9.0 | 0.00 | 81 | 11.6 | uzbekistan | tashkent | tas | uzb |
| 40535 | Vanuatu | Port Vila | -17.73 | 168.32 | 2024-03-29 03:00 | 25.0 | 0.62 | 100 | 3.7 | vanuatu | port vila | por | van |
| 40536 | Venezuela | Caracas | 10.50 | -66.92 | 2024-03-28 12:00 | 28.3 | 0.00 | 39 | 1.9 | venezuela | caracas | car | ven |
| 40537 | Vietnam | Hanoi | 21.03 | 105.85 | 2024-03-28 23:00 | 25.0 | 0.00 | 94 | 78.6 | vietnam | hanoi | han | vie |
| 40538 | Yemen | Sanaa | 15.35 | 44.21 | 2024-03-28 19:00 | 19.6 | 0.45 | 51 | 5.5 | yemen | sanaa | san | yem |
| 40539 | Zambia | Lusaka | -15.42 | 28.28 | 2024-03-28 18:00 | 25.0 | 0.00 | 58 | 13.0 | zambia | lusaka | lus | zam |
| 40540 | Zimbabwe | Harare | -17.82 | 31.04 | 2024-03-28 18:00 | 24.4 | 0.00 | 45 | 25.1 | zimbabwe | harare | har | zim |